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Abstract — We investigate the capacity of opportunistic com- 
munication in the presence of dynamic and distributed spectral 
activity, i.e. when the time varying spectral holes sensed by the 
cognitive transmitter are correlated but not identical to those 
sensed by the cognitive receiver. Using the information theoretic 
framework of communication with causal and non-causal side 
information at the transmitter and/or the receiver, we obtain 
analytical capacity expressions and the corresponding numerical 
results. We find that cognitive radio communication is robust to 
dynamic spectral environments even when the communication 
occurs in bursts of only 3 — 5 symbols. The value of handshake 
overhead is investigated for both lightly loaded and heavily 
loaded systems. We find that the capacity benefits of overhead 
information flow from the transmitter to the receiver is negligible 
while feedback information overhead in the opposite direction 
significantly improves capacity. 

I. Introduction 

Cognitive radio technology has tremendous potential for 
improving the utilization of radio spectrum. Derived from J. 
Mitola's doctoral thesis [1], a cognitive radio is an intelligent 
wireless communication system that relies on opportunistic 
communication between secondary users 1 over temporarily 
unused spectral bands that are licensed to their primary users 
[2]. It is driven by software defined radio technology which 
is in production and available now. However, the development 
of cognitive radio is still at a conceptual stage due to the 
multitude of challenges in how the radio learns and adapts 
to the local spectral activity at each end of the link. Various 
solutions seek to underlay, overlay or interweave [3] the 
secondary users' signals with the primary users in a way that 
the primary users of the spectrum are as unaffected as possible 
[4]. A cognitive radio user may co-exist with the primary 
users either on a not-to-interfere basis or on an easement basis 
[5] which allows secondary transmissions as long as they are 
below the acceptable interference temperature [6]. 

The physical separation of the cognitive radio transmitter 
and receiver leads to different perspectives on the respective 
local spectral activity. In general the spectral holes sensed 
by the transmitter of a cognitive radio may not be identical 
to those sensed by the corresponding receiver. One solution 
around this problem is to have an initial handshake between 
the transmitter and the receiver in the form of an RTS and 
CTS (request-to-send and clear-to-send) exchange before the 

'We use the terms cognitive radio users and secondary users interchange- 
ably. 



beginning of communication. Such a handshake requires an 
overhead on the forward link for the RTS signal and on the 
reverse link for the CTS signal. For the relatively stationary or 
slowly changing spectral usage scenarios initially envisioned 
for cognitive radio applications the overhead may be small. 
However, in order to understand the ultimate performance 
limits and thus the potential future applications for cognitive 
radio it is necessary to explore the fundamental limitations on 
its capacity in a distributed and dynamic spectral environment. 

II. System and Channel Model 

Figure ^ is a conceptual depiction of a cognitive radio 
link. The white nodes marked T and R are the cognitive 
radio transmitter and receiver respectively while the black 
nodes marked A, B and C are primary users of the spectrum. 
Given the nature of wireless propagation, spectral activity can 
be sensed only within a certain locality. The dotted regions 
around the cognitive radio transmitter and receiver represent 
the respective sensing regions. For the simplified scenario 
shown in Fig. ^ cognitive transmitter T can sense when 
primary users A or B are active and cognitive receiver R can 
sense when primary users B or C are active. Accordingly, 
receiver R detects spectral holes when B and C are inactive, 
while transmitter T detects spectral holes when A and B are 
inactive. As a consequence, the communication opportunities 
detected at the transmitter T and receiver R are in general 
correlated but not identical. 




Fig. 1 . Different Perspectives on Local Spectral Activity at Cognitive Radio 
(Secondary) Transmitter T and Receiver R 

We are interested in how the capacity of cognitive radio 
is affected by the distributed and dynamic nature of the 



spectral environment. These two notions are further explained 
as follows. 

• Distributed: We use the term distributed to indicate the 
different views of local spectral activity at the cognitive 
transmitter T and receiver R as shown in Fig. ^ More 
precisely we are interested in the correlation of the 
communication opportunities detected at the transmitter 
and the receiver. The smaller this correlation, the more 
distributed the system is. A more distributed system 
has less overlap in the sensing regions of the cognitive 
transmitter and receiver. 

• Dynamic: We use the term dynamic to indicate the time 
variation of the spectral activity of the primary users. 
Thus, a more dynamic system is one where the spectral 
activity changes faster and is less predictable. 

Intuitively, we expect that as the system becomes more 
distributed and more dynamic, the capacity will decrease. 
Our goal in this paper is to make this intuition precise and 
quantifiable. 

The capacity of the system shown in Fig. ^ depends on a 
number of variables such as the relative location of the primary 
and secondary users and the algorithm used to detect the 
transmission opportunities. As a first step, we start with a basic 
parametric model for which we compute the capacity. The 
values of the parameters corresponding to the peculiarities of 
various physical scenarios can be computed separately in order 
to match the capacity results to specific system configurations. 

A. Two Switch Channel Model 

We use the notion of distributed side information to capture 
the localized spectral activity estimates at the transmitter and 
the receiver. The system depicted in Fig.n can be reduced to 
the switched channel model shown in Fig. [2] The input X is 




St Sr 
Fig. 2. Equivalent Channel Model 



related to the output Y as 

Y = (XS T + N)S R (1) 

where N is additive white Gaussian noise (AWGN), and 
St, Sr € {0, 1} are binary random variables modeled 
as switches that represent the communication opportunities 
sensed at the transmitter and receiver. St = 0, i.e., the 
transmitter switch is open whenever the transmitter T senses 
that a primary user is active in its sensing region. Thus the 
only transmission opportunities for the transmitter correspond 
to St — 1, when the switch is closed. Similarly, Sr = 0, or 
the receiver switch is open when the receiver sees interference 
from active primary users in its sensing region. Thus, the 
receiver in this case discards the channel output when it is 
not perceived to be a communication opportunity. From the 



receiver's viewpoint, the only communication opportunities 
correspond to Sr = 1, when the receive switch is closed. 

The key observation here is that the switch state St is 
known only to the transmitter and the switch state Sr, is known 
only to the receiver. Therefore, the channel of Fig. |2] corre- 
sponds to communication with distributed side information. 
Note that the distributed nature of the system is captured in 
the correlation of St and Sr while the dynamic nature is 
captured in the rate at which the switches change state. We 
assume that the temporal variation of the switch states follows 
a block static model. In other words, the switches retain then- 
state for a period of T c channel uses (one block) after which 
they change to an independent identically distributed (i.i.d.) 
state. 

The transmit power constraint associated with the channel 
of Fig. |2] is 

E [\X\ 2 S T ] < P, (2) 

since no power need be transmitted when St = 0. 

In general, the channel model may include fading and 
interference. Moreover the receiver may be on all the time 
(Sr = 1). For simplicity of exposition we postpone these 
additional considerations for later sections and start with the 
basic model of Fig. [2] which includes no fading or interference 
and where the receiver discards the information received when 
interference from primary user is present. 

III. Capacity with Dynamic Spectral Activity 

The coherence time T c determines the dynamic nature of the 
spectral environment. Since single letter capacity expressions 
are available for memoryless channels, we will start with 
the assumption that T c = 1 i.e., the extreme case of a 
dynamic environment where each channel use corresponds 
to an independent spectral usage scenario. Such a scenario 
could correspond to fast frequency hopping. T c = 1 is 
also an accurate predictor of the capacity limits of cognitive 
communications where the coding scheme does not utilize 
the channel memory (i.e. transceivers that are unaffected 
by interleaving). The case of T c > 1 introduces memory 
into the channel process. In this case, single letter capacity 
characterizations are not known. However, the channel can still 
be viewed as a memoryless channel over an extended alphabet 
X corresponding to T c channel uses [7]. With the extended 
alphabet, the i th channel use corresponds to the transmitted 
symbol X(i) = {X(iT c ),X{(iT c + 1), • • • , X(iT c + T c - 1)}. 
With the extended alphabet the channel is memoryless and the 
capacity expressions provided below for T c = 1 are directly 
applicable when we replace the input and output symbols with 
their corresponding T c channel use extensions. Both cases of 
T c = 1 and T c > 1 are considered in Section fvTTl 

IV. Capacity with Distributed Spectral Activity 

Capacity with distributed side information at the transmitter 
and receiver has been studied for the two different cases of 
causal and non-causal side information at the transmitter. Un- 
like transmitter side information, for receiver side information 



it does not matter whether it is obtained causally or non- 
causally because the receiver can always wait till the end to 
decode everything. The corresponding capacity expressions for 
causal and non-causal side information at the transmitter are 
presented in this section. 

A. Capacity with Non-Causal Side Information - Frequency 
Coding 

For cognitive radio non-causal side information arises when 
the coding is performed in the frequency domain. The trans- 
mitter and receiver scan the spectral activity in a wideband and 
communicate opportunistically through a codeword spanning 
multiple frequency slots that are presumed to be idle. 

For the single user memoryless channel where i.i.d. side 
information St is available non-causally at the transmitter, 
the capacity is known to be [8]— [12]: 



where 



C 



St,Sr 



max I(U; Y, S R ) ~ I(U; S T ) 



(3) 



where K, raslll = {P(U,X\S T ) = P(U\S T )P(X\U, S T )}- U 
is an auxiliary random variable, X and Y are the input and 
output alphabet, and S R is i.i.d. side information available at 
the receiver. 

Comparing this to the case where no side information is 
available {St = <fi), 

C^s R = max I(U; Y, S R ) = max I(X; Y. S R ), (4) 

P(U,X) P(X) 

note that the availability of side information at the transmitter 
is helpful in that the transmitter can match its input to 
the channel information by picking the input alphabet U, X 
conditioned on St, as opposed to where the input can 
not be matched to the channel state. However, the benefit of 
matching the input to the channel state comes with the cost 
of the subtractive term in (3}, i.e., I(U; St) which can be 
interpreted as the overhead required to inform the receiver 
about the adaptation to the channel state at the transmitter. 

B. Capacity with Causal Side Information - Temporal Coding 

The case of causal side information represents coding in the 
time domain. The transmitter and receiver monitor the primary 
users' activity in a specific frequency slot and opportunistically 
use it for secondary communications when it is perceived to 
be idle. 

For the case where the side information is available at the 
transmitter only causally, the capacity expression has been 
found by Shannon [13] as 



C 



s T ,s R =™* I (T;Y 1 S B ) 



(5) 



where T is an extended alphabet of mappings from the channel 
state St to the input alphabet X. 

Recent work has also presented an alternate form for the 
capacity with causal side information that is directly related 
to the corresponding expression for the non-causal side infor- 
mation case. 



/^non -causal 



max^^,, I(U; Y, S R ) - I(U; S T ), 
I(U;YS R )-I(U;S T ), 



= {P(U,X\S T ) = P(U\S)P(X\U,S T )} 
= {P(U, X\S T ) = P(U)P(X\U, S T )} 



In the non-causal case, the choice of U can be made con- 
ditional on the channel state St- In the causal case U is 
picked independent of St- This makes the subtractive term 
equal to zero for the causal case. In both cases, it suffices for 
the optimal input symbol X to be just a deterministic function 
off/, S T . 

V. Capacity of Two Switch Channel Model 

Unlike the direct maximization of mutual information 
I(X;Y) over the input alphabet distribution P(X) in the 
absence of side information, capacity characterization in the 
presence of side information is more involved because of the 
additional auxiliary random variable U. However, as we show 
in this section, for the two switch channel model of Fig. |2] 
a direct capacity characterization may be obtained for both 
causal and non-causal side information. 

A. Causal Side Information at the Transmitter 

Theorem 1: For the two switch channel model of Fig. [2] 
and causal side information at the transmitter coding can be 
performed directly on the input alphabet (i.e., U = X) and 
the channel capacity is 



CsT, Sr (P) 



maxI(X;Y,S R ) 
p(x) v HJ 



where the channel input X satisfies the power constraint 

P 



E[\X\ 



Prob(S T = 1) 

Proof: For causal side information at the transmitter, 
we know from the previous section that the input X is a 
deterministic function of the auxiliary random variable U 
and the transmitter side information, i.e. X = /(£/, St)- 
Equivalently, let 



X 



h(U), S T = 1, 
f (U), S T = 0. 



Then the capacity is unchanged if we instead assume 

X = /!([/), S T = 0,1. 

The reason is that the channel output is unaffected by X when 
the switch is open (St — 0). Therefore, the transmitted symbol 
X is inconsequential for St = 0. Now, according to (|6j X 
is a deterministic function of the auxiliary random variable 
U regardless of the channel state. Therefore, decoding U is 
the same as decoding X and without loss of generality we 
can replace U = X in the capacity expressions. Furthermore, 
since U is independent of St for causal side information, the 
input X is also independent of St- Therefore, the capacity is 



the same as when the side information is not available to the 
transmitter and the power constraint (|2ji can be expressed as 

21 



E[\X\ 2 S T ] 
^E[|X| 2 ] 



E[\Xf]E{S T } = P 
P 



Prob(5 T = 1) 

This completes the proof. 

Theorem [2 leads directly to the following corollary. 
Corollary 1: 



C ST,S R ( P ) = C <t>;S R 



P 



i Prob(5 T = 1), 

The capacity of the two switch channel model with causal 
transmitter side information and transmit power P is the 
same as the capacity with no transmitter side information and 
transmit power where St = E[St] is the probability that 
the switch St = 1, i-e., the average fraction of spectrum that 
is sensed to be idle at the transmitter. 

Corollary ^ shows that for the two switch channel model of 
Fig. |2 there is no benefit of causal side information at the 
transmitter except the power saving that can be achieved by 
not transmitting when St — 0. In particular, the optimal 
codebooks are the same whether the transmitter knows the 
switch state St progressively (causal) or not at all. The only 
advantage of knowing the switch state is that a higher transmit 
power codebook may be picked and the power saved by 
replacing the codeword symbol with X = whenever St = 0. 

Finally, note that although the channel is an additive white 
Gaussian noise channel, the optimal input distribution is not 
necessarily Gaussian because of the switch St whose state 
is not known to the receiver. Input optimization requires the 
entropy maximization of a mixture process, consisting of the 
AWGN and the transmitted symbol X. Gaussian mixture 
models are traditionally used for classification in pattern 
recognition literature based on the maximum entropy principle. 
Therefore, we will use a Gaussian input distribution for our 
capacity calculations. Note that Gaussian inputs may not be 
strictly optimal, but they do represent an innerbound that we 
expect to be fairly tight based on the corresponding results in 
entropy maximization with Gaussian mixture processes [14]. 

B. Non-Causal Side Information at the Transmitter 

Theorem 2: For the two switch channel model of Fig. 2 
and non-causal side information at the transmitter, coding can 
be performed directly on the input alphabet (i.e., U — X) and 
the channel capacity is 



C n on -causal 



-(P) = max I(X;Y,S R )-I(X;S T ) 



with the power constraint E[|X| 2 S't] = P. 
The proof proceeds as in the causal case so we have X = 
fi(U), St — 0,1 and therefore we can set X = U. However, 
there is a difference. In the non-causal case, U depends on St 
and therefore, the input X is not independent of the transmitter 
side information. Interestingly, even though the output is not 
affected by the transmitted symbol when St = 0, the choice 



of the input distribution P(X\St — 0) does affect the mutual 
information I(X; Y, Sr). Therefore, one cannot arbitrarily set 
P{X\S T = 0) = P(X\S T = 1) as in the causal case. 
Another observation that can be made is the analogy between 
the current problem and the memory with stuck-at defects 
explored in [9], [10]. The open switch state is similar to the 
stuck-at defect because when St = the output is independent 
of the input, i.e. P(Y\X, S T = 0) = P(Y\S T = 0). 

Based on the similarity of our two switch channel model 
and the memory with stuck-at defects explored in [10] and the 
optimal input distribution presented in [10] we pick the input 
distribution as: 

X = AT(0,P), S T = 1 
X=Af(0,a), S T = 0- 

and we optimize over a. Again, this choice of input distribu- 
tion is an innerbound on capacity. 

Next we present several capacity outerbounds which com- 
plement the results presented above as well as represent the 
precise capacity under some additional assumptions. 

VI. Capacity Outerbounds 

A. Capacity with Global Side Information: 

Global side information refers to the case where both St 
and Sr are known to both the transmitter and the receiver. In 
this case the transmitter and the receiver know exactly which 
spectral holes are unused at both ends of the cognitive radio 
link. Therefore, the transmission is restricted to the slots where 
St — 1) Sr = 1. Gaussian inputs are optimal in this case and 
the capacity expression is: 

P 



C*,*(P) =VT0b(S T S R = l)log 1 



Pmb(S T SR = 1) 



Notice that in this case, capacity with causal side information 
is identical to capacity with non-causal side information. 

B. Capacity with Full Side Information at Receiver: Cs Tl * 

Full side information at the receiver refers to the case where 
both St and Sr are known to the receiver. The transmitter is 
still assumed to know only St- In previous work [15] we 
have shown that when the transmitter side information is also 
available at the receiver, the capacity is the same with causal 
or non-causal side information at the transmitter. In this case 
also, Gaussian inputs are optimal and the capacity is given by: 

P 



C St AP) = PioHStSr = 1) log 1 



Prob(S T = 1), 

C. Capacity with Full Side Information at Transmitter: C+^Sr 

Full side information at the transmitter refers to the case 
where both St and Sr are known to the transmitter. The 
receiver is still assumed to know only St- This is of practical 
significance since it represents the capacity with a channel 
state feedback channel from the receiver to the transmitter. 
Through this feedback channel the transmitter can potentially 
learn the instantaneous channel state at the receiver and adapt 



Model 1 (Y = S D ( S T X + N)) with u = 0.9 and u = 0.9 

HI ! 2 




Correlation coefficient p 

Fig. 3. Capacity of the two switch model in a lightly loaded system 

its transmit strategy accordingly. The following relationship is 
easily obtained for the causal case: 

c^ R (P) = csZs R (p Ioh{SR = llST = 1) ) 

The corresponding expression for the non-causal case is 
C—'(P) = max I(X;Y\S R )-I(X;S T \S R ) 

P(X\bT,JR) 

with the power constraint E[|X| 2 Sr] = P. 

VII. Numerical Results 

To gain numerical insights we compute the capacity ex- 
pressions presented in the preceding sections. Figures [3] and 
|4] represent cognitive communication scenarios where, on the 
average, active primary users occupy 10% (lightly loaded 
system) and 90% (heavily loaded system) of the available 
spectrum, respectively. Thus, for the basic model of the 
lightly loaded and heavily loaded scenarios correspond to 
E[St] = E[Sr] = 0.9 and 0.1, respectively. For transmit 
power of 10 dB, The average throughput is plotted as a 
function of the correlation coefficient p between St and Sr. 
As mentioned before, p quantifies the distributed nature of the 
spectral activity. 

Several interesting observations can be made from these 
plots. First, unlike the case of the memory with stuck-at 
defects considered in [10], non-causal side information does 
not appear to offer a significant advantage over causal side 
information for the input distributions considered. This ad- 
vantage disappears even further for an overpopulated system. 
Second, while Cs T .+ dominates C+ y s a m an underpopulated 
system, the opposite is true in an overpopulated system. This 
suggests that transmitter side information is more valuable 



Model 1 (Y = S D ( S T X + N)) with |i. = 0.1 and n„ = 0.1 

nil c. 




Correlation coefficient p 

Fig. 4. Capacity of the two switch model in a heavily loaded system 

in an overpopulated system while receiver side information 
is more valuable in an underpopulated system. Thirdly, the 
plots reveal how the dynamic nature of the spectral activity 
affects the capacity. Intuitively one would expect that as the 
channel coherence time increases the receiver will acquire 
more knowledge of the transmitter state St- Mathematically, 
we expect that as T c -> oo, Cg^(P) -> Cg^ ai (P). 
However, the plots indicate that the convergence is very rapid. 
Even with a channel coherence time as small as 3 channel 
symbols, the capacity Cs""<j a/ (P) is approximately the same 
as with perfect knowledge of the communication opportunities 
at the receiver. 

The rapid convergence of Cg^£* { (P) to C c s a ^l al {P) is a 
positive indicator of the capabilities of cognitive radio systems 
under highly dynamic spectral environments. Cognitive radio 
is based on the premise of minimal interference to the primary 
users of the spectrum. In order to ensure this, the cognitive 
communications must occur over short bursts, allowing the 
secondary users to frequently check for primary users' activity 
and to jump out of the active bands as soon as the primary 
users' presence is detected. If the primary users' spectral 
activity is highly dynamic, the length of the opportunistic 
communication bursts must be very small to ensure minimal 
interference. The numerical results show that even with bursts 
as small as 3 — 5 channel uses, the capacity achieved is as high 
as with full knowledge of the spectral holes at the receiver. 
The numerical results indicate that cognitive communication is 
fairly robust to the dynamic nature of the spectral environment. 

Finally, note that while the capacity benefits of sharing the 
transmitter side information with the receiver are automatically 
obtained for T c > 3, the additional benefits of sharing the 
receiver side information with the transmitter are significant 



and not automatically obtained. In other words, overhead 
information from the receiver to the transmitter improves 
capacity significantly but the overhead of information flow 
from the transmitter to the receiver presents no benefit when 
communication bursts last longer than 3 channel symbols. 

VIII. Extensions 

For simplicity, we presented our results for the two switch 
model of Fig.|2] The analysis as well as the numerical insights 
of the previous sections extend to more complex models 
when we incorporate additional considerations such as fading, 
interference and channel knowledge. For want of space, we 
will briefly summarize some of these extensions and leave a 
detailed exposition to [16]. 

For a general model we include channel fading and inter- 
ference. We also allow the receiver to process all received 
signals. Notice that unlike the transmitter which needs to shut 
off transmission to avoid interference to the primary users, the 
receiver can stay on all the time. While this can increase the 
signal processing required at the receiver, it represents a more 
capable system with a higher capacity. Such a system can be 
represented by the following system model: 



Model 2 (Y = S T X + S R N) with ^=0.9, u 2 =0.9 and s<, 1) =1 , s<, 2) =3.1623 

3.25 1 1 , , 1 1 , , 1 r- 



Y = S T X + S R N, 



(6) 



where, Y, X, St and N are defined as before. However, 
instead of a binary switch, Sr represents a general channel 
state. Notice that both the channel fade and the interference 
(assumed Gaussian) power can be included into the AWGN 
normalization factor Sr. A high value of Sr represents 
poor communication conditions, either because the channel 
is severely faded or because the interference from the active 
users near the receiver is too strong. Note that we do not 
allow the cognitive radio receiver to decode and subtract the 
interference. As before, the assumption is that St is known 
only to the transmitter and Sr is known only to the receiver. 

The numerical results for the general model are shown in 
Fig-ID F° r the pl ot we have assumed a two state channel with 
a good and bad state corresponding to SNR of 10 dB and 
dB respectively. If we choose bad channel as —10 dB, the 
performance is identical to the two switch model. This shows 
that the receiver can ignore the received signal in bad SNR 
conditions without a significant capacity loss. More elaborate 
fading models lead to similar results and are included in [16]. 

IX. Conclusion 

We explore the capacity of a cognitive radio system when 
the communication opportunities detected at the secondary 
transmitter and receiver are correlated but not identical. The 
problem is formulated as communication with distributed and 
time varying side information at the transmitter and receiver. 
We develop a two-switch model for which we present capacity 
expressions as well as inner and outer bounds. Using the ca- 
pacity expressions we determine the value of side information 
and the necessity of the overhead associated with a handshake 
between the transmitter and receiver to initiate opportunistic 
communication. 
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Fig. 5. Capacity of the general model in a lightly loaded system 
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